ShAPO: Implicit Representations for Multi-object Shape, Appearance, and Pose Optimization
نویسندگان
چکیده
Our method studies the complex task of object-centric 3D understanding from a single RGB-D observation. As it is an ill-posed problem, existing methods suffer low performance for both shape and 6D pose size estimation in multi-object scenarios with occlusions. We present ShAPO, joint detection, textured reconstruction, object estimation. Key to ShAPO single-shot pipeline regress shape, appearance latent codes along masks each instance, which then further refined sparse-to-dense fashion. A novel disentangled database priors first learned embed objects their respective space. also propose novel, octree-based differentiable optimization step, allowing us improve simultaneously under space, analysis-by-synthesis implicit representation allows accurately identify reconstruct unseen without having access meshes. Through extensive experiments, we show that our method, trained on simulated indoor scenes, regresses real-world minimal fine-tuning. significantly out-performs all baselines NOCS dataset 8% absolute improvement mAP
منابع مشابه
Shape Representations for Object Recognition
The problem of object recognition has been at the forefront of computer vision research in the last decade. The most successful approaches have used mainly edgeor texture-based representations. The shape of the object outline, albeit widely used for pre-segmented objects, has found limited applicability to the detection problem in real images. The fact that shape is a truly holistic global perc...
متن کاملObject Discovery through Motion, Appearance and Shape
We examine the problem of Object Discovery, the autonomous acquisition of object models, using a combination of shape, appearance and motion. We propose a novel multi-stage technique for detecting rigidly moving objects and modeling their appearance for recognition. First, a stereo camera is used to acquire a sequence of images and depth maps of a given scene. Then the scene is oversegmented us...
متن کاملPose Estimation and Object Identiication Using Complex Algebraic Representations Pose Estimation and Object Identiication Using Complex Algebraic Representations
The comparison and alignment of two similar objects is a fundamental problem in pattern recognition and computer vision that has been considered using various approaches. In this work, we employ a complex representation for an algebraic curve, and illustrate how the algebraic transformation which relates two Euclidean equivalent curves can be determined using this representation. The idea is ba...
متن کاملModeling Pose/Appearance Relations for Improved Object Localization and Pose Estimation in 2D images
We propose a multiview model of appearance of objects that explicitly represents their variations of appearance with respect to their 3D pose. This results in a probabilistic, generative model capable of precisely synthesizing novel views of the learned object in arbitrary poses, not limited to the discrete set of trained viewpoints. We show how to use this model on the task of localization and...
متن کاملTextonBoost: Joint Appearance, Shape and Context Modeling for Multi-class Object Recognition and Segmentation
This paper proposes a new approach to learning a discriminative model of object classes, incorporating appearance, shape and context information efficiently. The learned model is used for automatic visual recognition and semantic segmentation of photographs. Our discriminative model exploits novel features, based on textons, which jointly model shape and texture. Unary classification and featur...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-20086-1_16